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Abstract 

Faussurier et al. [Phys. Rev. E 65, 016403 (2001)] proposed 
to use a variational principle relying on Jensen-Feynman (or Gibbs- 
Bogoliubov) inequality in order to optimize the accounting for two- 
particle interactions in the calculation of canonical partition functions. 
It consists in a decomposition into a reference electron system and a 
first-order correction. The procedure appears to be very efficient in 
order to evaluate the free energy and the orbital populations. In this 
work, we present numerical applications of the method and propose 
to extend it using a reference energy which includes the interaction 
between two electrons inside a given orbital. This is possible thanks 
to our efficient recursion relation for the calculation of partition func- 
tions. We also show that a linear reference energy, however, is usually 
sufficient to achieve a good precision and that the most promising way 
to improve the approach of Faussurier et al. is to apply Jensen's in- 
equality to a more convenient convex function. 

1 Introduction 

The superconfiguration method [TJ is a powerful technique for the study of 
opacity and equation of state of hot plasmas. The formalism requires the cal- 
culation of the partition functions of the superconfigurations using recursive 
methods, and relies on independent-particle statistics. For that purpose, in 
the original STA (Super Transition Arrays) approach, two-particle interac- 
tions are averaged over the configurations of a given superconfiguration. The 
formalism proposed by Faussurier et al. [2] consists in an optimization of the 
orbital energies taking into account the interactions using a variational prin- 
ciple. This method reduces drastically the number of superconfigurations 
required for the convergence of STA calculations, due to a better treatment 
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of the electron correlations. In the present work, we investigate two possible 
improvements of the latter approach. 

The first one consists in using a reference energy which is quadratic with 
respect to the populations. This enables one to take into account the inter- 
action between two electrons belonging to the same orbital. Moreover, it can 
be easily understood that a quadratic reference energy is more appropriate 
for the description of a system including quadratic two-body interactions. 
We show, however, using a more general reference energy, that the linear 
reference energy gives results that are already very close to the exact values. 

The second improvement consists in applying Jensen's inequality to an- 
other convex function [3], which is the difference between the exponential 
function and its (2n — l)* ft -order Taylor development. Such a procedure 
requires the computation of high-order moments, but appears to be very 
powerful. 

The paper is organized as follows. In Sec. [2j we express the partition 
function of interacting electrons with an arbitrary reference system that can 
be optimized through a variational principle. In Sec. [3l we study the impact 
of choosing a particular reference energy on the accuracy of the method. We 
discuss the possibility, in our new recursion relation [HE], to use a non-linear 
reference energy, in order to better account for two-electron interactions. In 
Sec. IU we propose an improved Jensen-Feynman inequality. Sec. [5] is the 
conclusion. 

2 Statistics of interacting electrons 
2.1 Generalities 

The central-field model is the simplest approach for calculating the atomic 
structure of an interacting many-electron system. Each electron is assumed 
to move independently in a central potential that represents the electro- 
static field of the nucleus and the spherically averaged mutual repulsions 
of the other electrons. This central-field Hamiltonian allows one to char- 
acterize the quantum states of the system (energies and wavefunctions) by 
means of degenerate relativistic or non-relativistic electron configurations. 
The latter are defined as groups of degenerate subshells of the type {nili) Vi 
(non-relativitic case) or [nil{ji) Vi (relativistic case), where rii is the principal 
quantum number, 1^ is the orbital quantum number, ji = A i 1/2 the rela- 
tivistic angular momentum, and pi is the population of the i th orbital such 
that < pi < gi (the degeneracy gi is equal to Ali + 2 in the non-relativistic 
case, and to 2jj + 1 in the relativistic case). Since configurations differ only 
by the electron populations of orbitals, it is useful to represent them by the 
vector p = {p\,P2, ■ ■ - Pn} of N given orbitals. The accuracy of the central- 
field model can then be improved by evaluating the non-central parts of the 
Hamiltonian using the first-order perturbation theory. Accounting for one- 
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and two-body operators in the Hamiltonian of the system, the energy E(p) 
of a configuration reads 

N N 
i=l M=l 

where (energy of orbital i) and Vij (interaction energy between or- 
bitals i and j) are evaluated from the central-field Hamiltonian, and 6ij is 
Kronecker's symbol. 

The main purpose of this paper is to study the statistics of physical 
quantities over an arbitrary canonical ensemble (denoted £) of configura- 
tions, with Q electrons populating N orbitals. The set of configurations in 
£ is obtained by fixing or varying the population of each orbital, with the 
constraint J2i=iPi = Q- The average value of any quantity A(p) over all 
the configurations of £ reads 

W = urn £ m G{p) e ' Pm - (2) 

^ ' pes 

In this expression, U {£) is the partition function of the ensemble £ and 
is defined as 

U{£) = e-? F = G(p) e^ B ®, (3) 
pee 

where F is the total free energy and G(p) the degeneracy of the config- 
uration p: 

N / \ 

G ® -n(S)- (4) 

Such expression of the degeneracy involving binomial factors is the sig- 
nature of the Fermi statistics, i.e. of the proper accounting for the Pauli 
exclusion principle. It was shown in [6], using an integral representation 
of the partition function evaluated with the saddle-point method, that the 
average populations of orbitals obey Fermi-Dirac distribution in the limit of 
a large number of electron states. 

The sums contained in formulas ([2]) and ([3]) run usually over a very large 
number of electron configurations. A direct evaluation of these expressions 
is therefore a hard and sometimes an almost impossible task. Unfortunately, 
the quadratic dependence with respect to the populations of orbitals (due to 
two-body interactions) is known to prevent any factorization of the partition 
function. This drawback makes impossible the use of recursion techniques 
to speed up the calculation of averages. 
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2.2 Use of a reference system 

A way to avoid this factorization issue is to introduce an arbitrary reference 
energy Er{p), writing: 

U(S) = °(P) e-ft E *® +AE W (5) 

V 6 £ 

= (e-? AE ) R U R (£), (6) 

with AE = E(p) — Er(p). The letter R means that the assigned quantity 
is evaluated in the new reference system by replacing E{p) by E R (p). The 
notation (A)ji is thus defined as 

(A) R = -±- A(p) Gift e-? E «®, (7) 

where 

U R {£) = e~P FR = £ G (P) e"^*®. (8) 

V 6 £ 

The mean value of any quantity in the genuine interacting system can 
now be put in the form: 

which depends only on averages in the new reference system. 

The choice of the reference energy Er(p), though arbitrary, must en- 
able one to evaluate Ur{£) using recursion relations (factorizability) . This 
is discussed in Sec. III. Moreover, only averages of quantities which are 
multinomial functions of the populations {e.g. pi, p^-Pj, p\p 2 jP% etc.) can be 
expressed in terms of partition functions (see Appendix A). Therefore, an 
approximation has to be made for the exponential term in Eq. ([9]). This is 
discussed in the following subsection. 

2.3 Variational approach based on the Jensen-Feynman in- 
equality 

Jensen's inequality [7] states that for any continuous and convex function 
u — > f(u), one has 

(/(«)> > /««», (io) 



4 



which leads, for f(u) = e u , to 



-PAE\ >e -f3{AE) R (n) 
I R 



and therefore from Eq. 



U{£) > e-P^R U R (£). (12) 

Taking the natural logarithm of this expression, we obtain the first-order 
Jensen-Feynman (or Gibbs-Bogoliubov) [HI EJ [TO] inequality: 



F<F (1) , (13) 

where 

F« =F R + (AE) R (14) 

is an upper-bound approximation of the thermodynamic potential F. If 
the reference energy contains free parameters, they can be optimized through 
a minimization procedure, providing in that way a better approximation for 
F. 

It can be noticed that the Jensen-Feynman approach for deriving F^ is 
also consistent with the approximation e"^^ - ^^-^ ~ 1 in the expression 
of F. Performing the same approximation in Eq. ([9]) leads to 



(A)^(A) R . (15) 

3 First prescription: Choice of the reference sys- 
tem 

The choice of the reference system must allow one to factorize the partition 
functions in order to derive recursion relations. Recently, we have proposed 
an efficient technique [H [5] to compute the partition functions. It is based 
on a doubly nested recursion (on the number of electrons and orbitals), each 
orbital being added one after the other. Because of the orbital separability, 
this approach can be applied to a reference system with an energy of the form 
Er{v) = J2iL\ d(Pi), where d is an arbitrary function of the population of 
orbital i. Partition functions are then derived from the efficient recursion 
relation 

min(Q,g N ) 
Pjv=0 
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initialized with Uq. = 5q$. An important constraint of the recursive 
techniques is that only the average values of quantities containing integer 
powers of the populations can be deduced from the knowledge of partition 
functions (see Appendix A). 

In this section, we study the impact of choosing a particular reference 
energy on the accuracy of the method. A non-exhaustive list of reference 
energies is presented in Table [TJ 

In the reference energy A, the quadratic terms with respect to the pop- 
ulations are averaged over all the configurations. This is the method used 
in the original superconfiguration theory pQ. The case B corresponds to the 
work of Faussurier et al. : it consists of a linear form of the populations and 
has N degrees of freedom (namely {0i,i = 1,N}) obtained by minimizing 
the right-hand side of Eqs. (|13p . The reference energy C is an extended ver- 
sion of case B where a quadratic dependence of the populations in a given 
orbital is added. 

The case D is a power-law form which is introduced to infer some infor- 
mation about the optimal exponents in the reference energy. It requires 2N 
parameters 7$, i = 1, N}. It can be noticed that this choice prevents any 
closed- form evaluation of < AE >r in terms of recursive partition functions, 
because Er(p) is not a multinomial function. In this case, the calculations 
are performed "brute force" by summing over all the configurations. 

In case A (no adjustable parameters), the free energy is directly obtained 
by evaluating Eq. (I14p . In cases B, C and D, the free parameters are 
determined by minimizing the same equation. In practice, the minimization 
of the free energy is performed with a conjugate-gradient method by 
using the vector of derivatives with respect to the K free parameters {aj, i = 



which can be expressed analytically. For instance, for reference energy 
C, one has 



1,K}: 




(17) 



N 



1=1 




(18) 



and 
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i=l 



(PiPkiPk - - #i) 

AT 

+ 4 X] K^fo - S ij)) R (PkiPk ~ 1)) 



(Pi(jPj - $ij)Pk(Pk - l)) R ](Vij - faSij) 



(19) 



where the mean quantities are calculated from the partition functions of 
the reference system. 

For numerical illustration, we consider the case discussed in Ref. [llj . 
i.e., a copper plasma at T=100 eV and p=8.96 g/cm 3 . The statistics is 
performed over all relativistic configurations of the type 

K 2 L 8 (3s 1/2 3p 1/2 3p 3/2 3d 3/2 3d 5/2 4s 1/2 Ap 1/2 4p 3/2 ) Q , 

where Q electrons (which may vary from to 26) are distributed over the 
orbitals in parenthesis. The K (n = 1) and L (n = 2) shells are assumed to 
be full. The free energies presented and discussed below do not include the 
contribution of the frozen core. We can see in Table [2j for different values 
of the number of electrons Q (first column) , that the free energies when the 
interactions are artificially cancelled (third column) are very different from 
the exact values (second column). It is obvious that the inclusion of the 
interactions is crucial: the maximum error reaches about 65 % for Q=13 
(half the total degeneracy). When the interactions are averaged (fourth col- 
umn, reference energy A), the maximum error (still reached for Q=13) is 
reduced to 8 %. It is interesting to mention that the results do not change 
if the reference energy is shifted by a quantity which is independent of the 
populations (see Appendix B). The results can be improved by minimizing 
the free energy. For instance, with a linear reference energy and first-order 
Jensen- Feynman inequality (fifth column, with reference energy B), the 
maximum discrepancy between the obtained free energies and the exact val- 
ues drops to 0.03 %. One finds that the use of a quadratic reference energy, 
which introduces twice the number of free parameters (sixth column, refer- 
ence energy C) brings a slight improvement of the results which were already 
excellent using the reference energy B. This can be understood considering 
the "power-law" reference energy (seventh column, reference energy D). One 
finds that the exponents obtained after minimization are very close to one 
(see Table EJ for all the eight orbitals, which justifies the choice of the linear 
reference energy. One can also notice that the free energies obtained with 
the power-law reference energy D are very close to the ones obtained with 
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the quadratic reference energy C. This is another evidence that the linear 
part of the energy prevails over the quadratic part. 

Figures [T] and [2] display, for each orbital i, the ratio 8{/ei obtained af- 
ter minimization of the free energy with reference energy B and first-order 
Jensen- Feynman inequality (F^). One can see that for the highest-energy 
orbitals, 9i can be very different from ej, which can be explained by the fact 
that the electrons in such orbitals are very sensitive to electron-electron in- 
teractions, on the contrary to electrons in the lower orbitals which are more 
subject to the attraction of the nucleus. 

One can be surprised that some values of 9{ become positive (orbitals 
3d 3 / 2 an d 3d 5 / 2 , see Fig. [I]) for some values of Q. In fact, the reference 
energy B is the energy of a fictious non-interacting system. It represents 
both the linear part of the energy of the real system and the quadratic 
interactions, which are positive. Therefore, when the contribution of the 
quadratic interactions dominates, 9% can become positive. 

The strength of the method resides in the fact that, even when 9i differs 
notably from a, the populations of the corresponding orbitals are very close 
to the exact results, which was rather unexpected. 



4 Second prescription: higher-order Jensen- Feynman 
inequality 

A better approximation may be found by applying Jensen's inequality (I10p 
to the convex function 



fn(u) 



2n-l u 

EVT 

k=0 



(20) 



with u = —f3[AE(p) — (AE(p)) R ]. We obtain a new Jensen-Feynman 
inequality [3] with an adjustable precision driven by n, which reads 



(21) 



where 



F (2n-i) = Fr + (AE) R - - In 



P 



2n-l 



k=2 



k\ 



(22) 



and M k = ([AE - (AE) R ] k ) R . The expression of F( 2n ~V is more 
and more complicated as the order n increases, but all the quantities M^, 
k=l,- ■ ■ , 2n — 1 can still be obtained using our efficient recursion relations. 
The last column of Table [2] shows the values of the free energy developed at 
the third order (i.e. n = 2) and minimized with the linear reference energy 
C. We can see that, even with this simple reference energy, the obtained 
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free energy is always closer to the exact value than . The precision 
reached by increasing the order of Jensen- Feynman inequality is better than 
the one obtained within changing the reference energy and/or increasing the 
number of free parameters. We have checked that an arbitrary precision can 
be obtained by going to higher order (n=3 or more). However, it must be 
clear that the computational time increases rapidly with n due to multiple 
evaluations of _p( 2n_1 ) by the minimization routine. 



5 Conclusion 

In this article, it was proposed to account for two-body interactions in the 
calculation of partition functions in the canonical ensemble. Based on the 
work of Faussurier et al. relying on Jensen-Feynman (or Gibbs-Bogoliubov) 
inequality, we showed that, thanks to our recently published recursion rela- 
tion, it is now possible to use a quadratic reference energy, which takes into 
account electron-electron interaction inside a given orbital. The required 
new quantities were presented in a compact form. It was shown, however, 
using an optimized-exponent reference energy, that a linear reference energy 
is usually sufficient to achieve a high precision. Finally, we found that the 
approach of Faussurier et al. can be improved by applying Jensen's inequal- 
ity to the difference between the exponential function and its (2n — l)* ft -order 
Taylor development. In this case, the expression of the free energy to be 
minimized is more complicated, but the quantities (high-order moments) 
involved can still be obtained from our robust recursion relations. 



6 Appendix A: Averaging process 

When a function A{p) contains only powers of the populations, it is easy 
to show that its average value (A(p)) R , defined in Eq. (J7|), can always be 
calculated from the knowledge of partition functions, whatever the form of 
the reference energy. This is due the following relation between binomial 
coefficients: 

'©-'©-'C; 1 )- (23) 

This allows one to write, for instance, 



<Pi)R = 9 ' V'WM m 



and 
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(Pi(Pj -dij)) R = 9i(9j ~ kj) x 



+ 



^Q;iv[5] ^jvk] ) ' 



(25) 



where notation g 1 ^'" means that the partition function is evaluated with 
the degeneracy of orbitals i, j, k, etc. reduced by one. 



7 Appendix B: Invariance with respect to a trans- 
lation of the reference energy 



Let us consider the reference energy 

E R , c (p) = E R (p) + C, 
where C is a constant. One has 



(26) 



Fr,c 



P 



In 



p G £ 



p G £ 



■PE Rl c(P) 



-l3E R (p) 



+ c 



= F R + C. 

Moreover, for any quantity A (see Eq. (0), one has: 



(27) 



R,C 



1— E A(p) e-W*. 



c(p) 



Ur,c(£) 



p € £ 



-i- £ ^(p) e-^W 
^ ' p e 5 



(28) 



Therefore, one obtains 



(£ - E R)C ) R)C = (E- E R>C ) R 

= (E-E R ) R -C. (29) 

In the same way, one can check easily that the quantities do not 
depend on C. This leads to 
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F (2n-1) = Frc + {E _ Erc)rc 

2n-l „ k 



1 

1 



111 



fc=2 

Fr + (E — E R ) R 

2n-l 



fc=2 



(30) 



which means that the quantity F^ 2n ^ does not depend on C, whatever 
the value of n. 



8 Appendix C: Average-atom orbital energies 

There is an error in [2], which has not been properly corrected in the erratum 
[12j . In the original paper [2], formula (9) reads: 

4 on = -e a -E(K)o-^7)A a7 . (31) 

7 

In the erratum [TJJ, the authors suggest to replace that expression by 

C" = "€a " E(W) " WJ«)V. ( 32 ) 

7 

which is also wrong. The correct expression is: 

4™ = -e« - E K) (1 - S aj /g a )A aT (33) 
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System 


Reference energy Er(p) 


free parameters 


degrees of freedom 


A 


N 
i=l 


1 N 


none 





B 




IV 
i=l 




m 


N 


C 


N 

E 

1=1 






m,{M 


2N 


D 




TV 
i=l 




m,H} 


2N 



Table 1: Reference energies considered in this paper: A) quadratic part 
of the energy averaged; B) linear reference energy; C) quadratic reference 
energy; D) power-law reference energy. The free parameters in cases B, C 
and D are determined by minimizing the free energy. 



13 







F 


Q 


Exact 


Exact with Vij = 


2 


-810.1430 


-829.6907 


7 


-1813.4044 


-2187.1193 


13 


-1878.6536 


-3115.7014 


19 


-914.4643 


-3419.8402 


25 


1377.7764 


-2911.9648 







F (i) 




F(3) 


Q 


A 


B 


C 


D 


B 


2 


-809.9468 


-810.1370 


-810.1385 


-810.1386 


-810.1430 


7 


-1786.6201 


-1813.1371 


-1813.2003 


-1813.1985 


-1813.4034 


13 


-1716.0766 


-1878.0163 


-1878.1632 


-1878.1531 


-1878.6451 


19 


-612.4660 


-914.2243 


-914.2825 


-914.2815 


-914.4636 


25 


1474.4451 


1377.7764 


1377.7764 


1377.7764 


1377.7764 



Table 2: Free energy (eV) for Q=2, 7, 13, 19 and 25 electrons in orbitals 
(3s 1/2 , 3p 1/2 , 3p 3/2 , 3d 3/2 , 3d 5/2 , 4s 1/2 , 4p 1/2 , 4p 3/2 ) for a copper plasma 
at T=100 eV and p=8.96 g/cm 3 , calculated using the reference energies A, B, 
C and D of Table [TJ The free energy is developed either at the first (F^) or 
third (F^) order of the Jensen- Feynman approach. The second and third 
columns of the first sub-table above contain respectively the exact values 
and the values obtained when the interaction energies Vij are artificially 
cancelled. 
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Q 


3si/ 2 


3i?l/2 


3^3/2 


3^3/2 


3d 5 /2 


4si/ 2 


4i>l/2 


4p 3 /2 


2 


1.001 


1.001 


1.001 


1.002 


1.002 


1.023 


1.026 


1.028 


7 


1.002 


1.002 


1.004 


1.004 


1.006 


1.020 


1.023 


1.032 


13 


1.002 


1.004 


1.008 


1.007 


1.012 


1.017 


1.022 


1.056 


19 


1.001 


1.001 


1.006 


1.003 


1.012 


1.026 


1.032 


1.133 


25 


1.013 


1.009 


1.008 


1.007 


1.004 


1.029 


1.042 


1.187 



Table 3: Values of exponents ji obtained after minimization of the free 
energy within first-order Jensen- Feynman inequality, using reference energy 
D:£f =1 ^, 
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J I I I I I I I 

5 10 15 20 25 

Number of electrons (Q) 



Figure 1: (Color online) Ratio 6%/ei for orbital % (3s]y 2 to 3d 5 / 2 ) obtained 
after minimization of the free energy within first-order Jensen-Feynman in- 
equality (fl~3|) . using reference energy B. The ratios for 3pi/ 2 (thin line) and 
3p 3 /2 (crosses) are almost indistinguishable, as well as the ratios for 3d 3 / 2 
(dashed line) and 3p 5 / 2 (plus symbols). 
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20 - 
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5 10 15 20 
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Figure 2: (Color online) Ratio 9i/e{ for each orbital i (4s 5 / 2 to 4j> 3 / 2 ) 
obtained after minimization of the free energy within first-order Jensen- 
Feynman inequality (|13|) . using reference energy B. 
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